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ABSTRACT 

Template fits to observed galaxy fluxes allow calculation of if-corrections and conversions 
among observations of galaxies at various wavelengths. We present a method for creating model- 
based template sets given a set of heterogeneous photometric and spectroscopic galaxy data. 
Our technique, non- negative matrix factorization, is akin to principle component analysis (PC A), 
except that it is constrained to produce nonnegative templates, it can use a basis set of models 
(rather than the delta function basis of PC A), and it naturally handles uncertainties, missing 
data, and heterogeneous data (including broad-band fluxes at various redshifts). The particular 
implementation we present here is suitable for ultraviolet, optical, and near-infrared observations 
in the redshift range < z < 1.5. Since we base our templates on stellar population synthesis 
models, the results are intepretable in terms of approximate stellar masses and star-formation 
histories. We present templates fit with this method to data from GALEX, Sloan Digital Sky 
Survey spectroscopy and photometry, the Two-Micron All Sky Survey, the Deep Extragalactic 
Evolutionary Probe and the Great Observatories Origins Deep Survey. In addition, we present 
software for using such data to estimate if-corrections and stellar masses. 

Subject headings: galaxies: fundamental parameters — galaxies: photometry — galaxies: statis- 
tics 



1. Motivation 

New surveys at low and high redshift have provided us with estimates of galaxy spectral energy distri- 
butions (SEDs) for an enormous number of galaxies. When comparing populations of galaxies at different 
redshifts in these surveys, we need to use comparable measurements of the galaxy SEDs. However, different 
surveys use different bandpasses and the restframe wavelengths of these bandpasses necessarily vary with 
redshift. We need to be able to handle this heterogeneity in order to make sensible comparisons among all 
of these new surveys. 

In this paper, we present a method for doing so, by calculating /-C-corrections between observed and 
desired bandpasses. The i(r-correction between a bandpass R used to observe a galaxy at redshift z and the 
desired bandpass Q is defined by the equation (Oke & Sandage 1968; Hogg et al. 2002): 

ruR = Mq + DM(z) + Kqr{z) - 5 log^o h (1) 
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where DM(2:) = 25 — 5 log^^^ {dL/{h~^ Mpc)) is the bolometric distance modulus calculated from the lumi- 
nosity distance (II, and Mq is the absolute magnitude. The absolute magnitude is defined as the apparent 
magnitude an object would have if were observed 10 pc away, in bandpass Q, at rest. The traditional 
definition of the /C-correction takes Q = R. However, we note that in practice many surveys do perform 
iV'-corrections from one observed bandpass R to another bandpass Q in the rest frame. This practice is par- 
ticularly common when dealing with high redshift observations. In addition to /sT-corrections, this method 
also provides an interpretation of the data in terms of a physical model which describes the stellar mass and 
star-formation history of each galaxy. 

The method is designed to work well for a wide range of data sets. It uses photometry and spectroscopy 
of the Sloan Digital Sky Survey (SDSS; York ct al. 2000) the Galaxy Evolution Explorer (GALEX; Martin 
et al. 2005) in the ultraviolet, and the Two-Micron All Sky Survey (2MASS; Skrutskie et al. 1997) in the near 
infrared (NIR). In addition, at higher redshifts, we use constraints from the Deep Extragalactic Evolutionary 
Probe 2 (DEEP2; Davis et al. 2003; Faber et al. 2003) and the Great Observatories Origins Deep Survey 
(GOODS; Giavalisco ct al. 2004). These and other data sets provide a huge set of information about galaxy 
colors and spectra which we can use to help midcrstand their star-formation histories. 

We note that some of the algorithmic techniques used here may have applications in other areas of 
astrophysics. First, nonnegative matrix factorization (NMF; Lee & Seung 2000), and the extensions to it we 
describe here, is a particularly useful variant on principal component analysis. Second, the nonnegative least 

squares algorithm of Sha et al. (2002) has the virtue of being extremely simple to implement. These methods 
may find other applications in imago analysis, spectroscopic analysis, and model fitting in astrophysics. 

We also release the templates in electronic form as well as an implementation of the methods used 
to fit the templates to data. This software kcorrect v4_l is distributed on the World Wide Web^. It 
consists of a core C library which performs most of the complex and computationally intensive tasks, plus 
an IDL library that provides a high-level interface. This software is an update of two major earlier releases 
(vl_16; Blanton et al. 2003; v3_2). The improvement over the previous version is twofold. First, we have 
improved the templates such that they successfully fit galaxies in the restframe UV, as observed by GALEX, 
DEEP2, and GOODS. Second, because the templates arc now completely model-based, the fits have a 
physical interpretation in terms of a star-formation history. The IDL library also has a number of new useful 
functions. 

In Section 2, we describe how to find suitable templates, given a set of models and a set of data. We 
also describe the data and models used here. In Section 3, we show the results: the best-fit templates and 
how well they fit the data. In Section 4, wo describe how to convert our results into an estimate of the 
i^T-correction. In Section 5, we describe the physical interpretation of the templates and of fits to data. In 
Section 6, we present some simplified, linear transformations between various bandpasses. In Section 7, we 
summarize our results. Appendix A describes the NMF algorithm. Appendix B describes the electronic 
format of our results. 

Where necessary, we have assumed cosmological parameters fig = 0.3, Cl\ = 0.7, and Hq = 100 h km 
s~^ Mpc^^ (with h = 1), imless otherwise noted. All magnitudes are (unless otherwise noted) AB-relative. 
Except where noted, I will always be referring to the version of the kcorrect software labeled v4_l_4. 
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2. Finding the templates 
2.1. Overview 

From the spectroscopic observations we know that galaxy spectra reside in a low-dimensional subspace. 
Principal Component Analysis (PCA) of galaxy spectra, introduced by Connolly et al. (1995) and applied 
many times since then, demonstrates that most of the variance in the distribution of galaxies in spectral 
space can be explained using a few templates. This means that, in the linear space of all possible spectra, 
galaxies exist in only a small subspace. Therefore, even with very heterogeneous data, we should be able to 
determine the properties of this subspace. 

Here we present an approach to combining heterogeneous data in order to determine the properties of 
the subspace of galaxy spectra. Rather than taking the model-free approach used by PCA, we here restrict 
the space of possible spectra to those predicted from the high resolution stellar population synthesis model 
of Bruzual & Chariot (2003) and the nebular emission line models of Kewley et al. (2001). This approach 
both constrains the problem appropriately and yields a natural theoretical interpretation of the results in 
terms of star-formation histories. 

In a nutshell, our algorithm does the following. Given the observations (and uncertainties of those 

observations) available for each galaxy, it finds the nonncgativc linear combination of N template star- 
formation histories which best predict those observations in the sense. Given the entire set of galaxy 
observations available, it also fits for the N template star-formation histories. The technical name for this 
algorithm is Nonnegative Matrix Factorization (NMF; Lee & Seung 1999, 2000), and we describe it in detail in 
Appendix A. Note that this problem is not the same as the well-known nonnegative least squares problem, 
which would fit for the best-fit nonnegative linear combination of templates, but not for the form of the 
templates themselves. 

This approach is similar to PCA in that it finds the small spectral subspace in which galaxies exist, and 
can in some ways be thought of as "nonnegative PCA." However, our method has several advantages over the 
standard PCA approach. First, the results yield, along with a subspace in the space of all possible spectra, 
a natural interpretation, which is the corresponding subspace in the space of all possible star-formation 
histories. Second, it naturally handles data uncertainties and missing data, which allows it to ignore that 
variation which is due purely to statistical errors. Third, it handles the complications of observing galaxy 
spectra photometrically using broad-band filters of galaxies at varying redshifts, whereas standard PCA 
requires a constant grid of observations in rest-frame wavelength space. 

In the following subsections, we introduce the basis set of models we use here (§2.2), the data we fit to 
(§2.3), and how we combine the two to set up the NMF problem (§2.4). In Appendix A we describe how to 
actually solve to NMF problem. 

2.2. The models 

We begin with a basis set of 485 spectral templates. Of these, 450 are a set of instantaneous bursts 
from Bruzual & Chariot (2003), using the Chabrier (2003) stellar initial mass function and the Padoval994 
isochrones. We select 6 metallicities (0.005, 0.02, 0.2, 0.4, 1, and 2.5 times solar). For each metallicity 
we select 25 ages (between 1 Myr and 13.75 Gyr, spaced almost logarithmically in age). For each age and 
metallicity we make 3 choices of dust model: (1) no dust extinction; (2) ry = 3 dust with Milky Way type 



-4- 



extinction (as input into the models of Witt & Gordon 2000 with a "homogeneous" distribution and "shell" 
geometry); (3) Ty = 3 dust with SMC type extinction (from the same models). We smooth each template to 
300 km s~^ velocity dispersion (we will smooth the SDSS spectra in the training set to the same resolution). 

The remaining 35 templates are from MAPPINGS-III (Kewley et al. 2001) models of emission from 
ionized gas. We choose the predictions for an 8 Myr old continuous star- formation history with 5 possible 
metalliciticites (0.5, 0.2, 0.4, 1.0, and 2. times solar) and 7 possible ionization parameters (g = 5 x 10^, 
10"^, 2 X 10^, 4 X 10^, 8 X 10"^, 1.5 x 10®, and 3 x 10® cm s"^). We take the spectra (given as a set of delta 
functions) and smooth them to 300 km s~^ velocity dispersion. The one alteration we make to all of these 
templates is to remove Lya, because it is generally much larger than observed in real galaxies. 

Let us refer to these 485 basis templates as Mj{X), expressed in units of ergs s~^ A~^. 

We seek to reduce this full basis space to a subspace in which galaxies actually exist. In particular, we 
seek five templates Fi (A) built from nonnegative combinations of the original basis set of N templates: 

Fi{X)=J2KMjiX), (2) 
j 

in units of ergs s~^ A~^. In principle, we could seek any number of templates; from our experiments, we 
have found that five turns out to be a number large enough to explain the data wc use here. For each galaxy 
k we want our model Fx (A) for their spectrum to bo a noncgativc sum of these five templates: 

Fk{\) =Y,(^kiFi{X). (3) 



2.3. The data 

The training set consists of: 

1. SDSS spectroscopic data in the observed range 3800 < A < 9000 A, for 400 Luminous Red Galaxies 
between 0.15 < z < 0.5 (LRGs; Eisenstein et al. 2001) and 1600 Main sample galaxies between 
0.001 <z <0A (Strauss et al. 2002) 

2. SDSS photometric data on an independent set of LRGs {griz photometry only) and Main sample 
galaxies (using the full ugriz photometry) in the same redshift ranges. We use 2000 LRGs and 7000 
Main sample galaxies. For these galaxies we include 2MASS JHKg extended source catalog data 
(Jarrett et al. 2000) where available. 

3. GALEX DRl far UV (~ 1500 A) and near UV (~ 2300 A) photometry for Main sample SDSS galaxies 
with redshifts and ugriz photometry (4000 galaxies; Martin et al. 2005). 

4. The BRI photometry for high redshift galaxies in the DEEP2 DRl release between 0.6 < ^; < 1.5 
(4000 galaxies; Davis et al. 2003; Faber et al. 2003). 

5. The BVizJHKs photometry for GOODS galaxies between 0.5 < ^; < 2 (1000 galaxies; Giavalisco et al. 
2004). 

The SDSS, 2MASS, and GALEX galaxies were selected from and the matches were obtained by the New 
York University Value-Added Galaxy Catalog (NYU-VAGC; Blanton et al. 2005). 
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The SDSS data processing consists of astrometry (Pier et al. 2003); source identification, deblending and 
photometry (Lupton et al. 2001); photometricity determination (Hogg et al. 2001); calibration (Fukugita 
et al. 1996; Smith et al. 2002); spectroscopic target selection (Eisenstein et al. 2001; Strauss et al. 2002; 
Richards et al. 2002); spectroscopic fiber placement (Blanton et al. 2003); and spectroscopic data reduction. 
We recalibrated our photometry using the "ubcrcalibration" procedure described in Blanton et al. (2005). 
Descriptions of these pipelines also exist in Stoughton et al. (2002). An automated pipeline called idlspec2d 
(in this case v4_9) measures the redshifts and classifies the reduced spectra (Schlegel et al., in preparation). 
We use the Petrosian magnitudes for the SDSS data (except where noted below for the LRG-only sample). 

For the GALEX and GOODS data we use "auto" magnitudes, which are the Kron-like magnitudes from 
SExtractor (Bcrtin & Arnouts 1996). For 2MASS we use the "extrapolated" magnitudes from the Extended 

Source Catalog (Jarrett et al. 2000). 

Note that there arc 20 different broadband photometric filters listed above {B in DEEP2 is different 
than the B used by GOODS). The kcorrect product described in Section 7 contains a tabulation of the 
response functions for all of these filters. 



The data consist of spectra and bandband photometric measurements of galaxies at a number of red- 
shifts, and we have to relate the models to these measurements. 

For the spectra, we take the observed spectrum /^(A) (in ergs cm~^ s~^ A~^) of each galaxy k at 
redshift z (corresponding to a luminosity distance ^^(z) in cm; see Hogg 1999) and calculate the restframe 
luminosity per unit wavelength: 



That is, the spectrum is shifted due to the redshift, while the integral of the numerator over wavelength (the 
total luminosity) is constant with redshift, and the total fiux is related to the total luminosity by the inverse 
square law. In addition, we smooth each spectrum by such an amount that, given its estimated velocity 

dispersion, its total velocity dispersion after smoothing is 300 km s~"^ (there are a small number of galaxies 
with larger velocity dispersions, which we leave imchanged). 

Note that if we discretize the spectra to wavelengths A; , the relationship between the predicted SED for 
a galaxy k and the basis set Mji , becomes simply: 



If one has a spectrum for a galaxy the expression for the contribution to from each wavelength A; in 
the spectrum is then quite simple: 



2.4. Comparing data and models 



i^fc(A) =/fc[A(l + z)](l + z)(47rdi), 



(4) 




(5) 



Xki — 



(6) 
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Comparing observed broadband flux measurements is a bit more complicated, because it is a projection 
of the spectrum onto the broadband filter p at the observed redshift z. Instead of adjusting the observed 
fluxes as we could so easily do for the spectra above, for the photometry we express the models in terms of 
predicted broadband fluxes at each redshift. 

Here we express the flux for each galaxy k in units of AB maggies /Xp, which are defined as what one 
would measure in bandpass p relative to the AB standard source. For example, if we transform our spectral 
energy density basis function Mj{X) to a flux density mj{X), we can calculate the contribution of that basis 
function to the predicted maggies as: 

_ J^^ dXXRp{X)m,{X) 

dXXR,{X)f ab{X) ^ ' 

Here, the response function Rp{X) is proportional to the contribution to the detector signal of a photon with 
wavelength A entering the Earth's atmosphere (or entering the telescope for a space telescope). The AB 
standard source is fAB{X)dX = fAB{v)dv and /ab(2^) = 3631 Jy = 3.631 x IQ-^o erg s'^ cm'^ Hz'^ Of 
course maggies /U are related to magnitudes m as: 

m = -2.51ogioM, (8) 
such that the AB standard source would (if it existed) have = 1 and m = for all bandpasses. 

In this context it is worth noting that many authors (e.g. Bessell 1990) tabulate the contribution to 

the detector signal per unit of energy in photons of wavelength A instead of per photon with wavelength A. 
We will refer here to the former quantity as i?|/A), though in the literature it is often referred to without a 
prime (and without any explicit definition!). Clearly Rp{X) oc i?p(A)/A, since the higher the frequency of the 
photon, at a fixed response per unit energy there is a higher response per photon. With this substitution, 
one can rccxpress Equation 7 appropriately in terms of _Rp(A). Generally, though not universally, authors 
tabulate R'piX) for bandpasses whose standards were originally calibrated using energy-counting devices 
rather than photon-counting devices. However, from the point of view of the analysis of the observations 
it is irrelevant what the devices used for the standards and the observations are, as long as one calculates 
Rp{X) and uses Equation 7. 

The prediction for the broadband fluxes from Equation 7 is only for a specific redshift z. It turns out 
to simplify our mathematics to calculate the projection of each basis function j onto each filter p for a grid 
of redshifts (in this case spaced by 0.005 between redshifts and 2). Thus, below we will take p to index all 
the filters at all such redshifts. 

Just as before, we can now write down the relationship between the predicted broadband flux and the 
basis set fijp, in the bandpass and redshift corresponding to the index p, for a galaxy k: 



f^kp — / J ^ki^ijVjp (9) 



The contribution to the total of a broadband flux is therefore just: 

2 



l^kp f^kp 



(^kp 

t^kp ^^ij ^ki^ijVjp 
<^lp 



(10) 
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For a given galaxy wc do not have every filter, and we only have an observation of each filter at a single 
redshift. We pick the closest redshift on the redshift grid, and use the measured filters at that redshift for 
our expression of x^, and set l/c^y = for the rest of the values of the index p so that we zero- weight those 
predictions. 

The matrices r}jp (related to the broad-band fluxes) and Mji (related to the spectra) are totally fixed, 
and we combine them into a single matrix Afj„ and the indices p and I into a single index n, to handle 
both the broad-band fluxes and spectra simultaneously. Wc can similarly combine our observations fXkp and 
Ffe(Ai) and their uncertainties Okp and (Jk{\i) into vectors xtn and Ukn- Then we can combine the Equations 
6 and 10 into a single equation: 



There is a simple method that we describe in Appendix A called nonncgativc matrix factorization (NMF) to 
iterate to the nonnegative aki and hij which locally minimize Equation 11. The basic method is implemented 
in a public piece of code named NMF .SPARSE in the idlutils distribution of IDL utilities.^. As the name 
implies, our implementation takes advantage of the fact that many of the matrix operations are on very 
sparse matrices (for example, for each galaxy with photometric data there are no spectroscopic data points 
and only photometric data at a single redshift). 

As we describe in Appendix A, the NMF problem is not convex: that is, there are multiple local 
minima. Our method finds one of the local minima, but is not guaranteed to find the global minimum. 
Thus, the reader who tries to reproduce our results using our methods or others may (depending on their 
initial conditions) find different local minima of than do we. 

Once wc have fit for hij using the training set, we can minimize Equation 11 for any other galaxy using 
any nonnegative least squares algorithm to determine Oki (since the minimization of Equation 1 1 has a linear 
form in that case). When we do so here we use the beautifully simple iterative method of Sha et al. (2002). 



We begin with the simplest case, which is fitting a single template to the photometric data of the LRG 
sample; (Eisenstein et al. 2001). For many of these galax;ies, which extend to r < 19.5 and are intrinsically 
red, the u-band flux is extremely poorly measured, so we ignore the M-band for all LRGs. 

Figure 1 shows the spectrum of the best-fit LRG. This template is constrained by data between about 
2000 A and 10000 A; outside that range it is an extrapolation. 

Figure 2 shows the star-formation history corresponding to this best-flt LRG spectrum. The top panel 
shows the star-formation per unit time as a function of look-back time. The bottom panel shows the mean 
metallicity of the forming stars as a function of time. When considering these plots and those below, do 

remember that although these results arc our best-fit, the parameters are highly degenerate (especially in, 
for example, neighboring age bins). We show these merely to illustrate the general nature of the fits. 




(11) 



3. 



Results 



3.1. An example: the Luminous Red Galaxy templates 



^http : / /skymaps . info 
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Figure 3 shows the LRG colors as a function of redshift, with the best fit template color overplotted as 
the smooth, thick line. In the code described in Section 7, we used the routine sdssJjcorrect (with the 
optional flag /Irg) to perform these fits. The best fit is a good fit to the data. 

Note that in this example we have fit a nonevolving template, which is inappropriate over this range of 
redshifts — even if it is a good fit to the colors! In principle, we can adjust the methods used here for the 
case of evolving templates, but we will not do so here. 

In Appendix B we describe the form in which we release the star-formation histories and spectra asso- 
ciated with this template. 

3.2. Templates for the full data set 

We next apply the method to the full data set and fit for five templates. Prom experiments, we found 
that we needed five to adequately explain the data. More than five templates would allow us to explain 

the data better, but only marginally so. The right-most column of Figure 4 shows the spectrum from the 
ultraviolet through the near infrared for the resulting templates. Note that there is a very old template, a 
very young template, and several intermediate templates, including one which is close to that of an A star. 

The left and middle columns of Figure 4 show the star-formation histories and metallicities associated 
with the templates. We have not imposed any smoothness criterion on these fits, which explains the ragged 

appearance of these histories. The details of these fits arc weakly constrained due to degeneracies within 
and among the templates, but we show them here for completeness. 

In all the results below, we use these five templates, unaltered. That is, when we speak below of "fitting" 
the templates, we mean we fix bij to the five templates shown in this section and fit only for Ofcj. Since 
we have determined these templates to be appropriate for the data sets we include in the training set, they 
end up providing good fits to most other galaxies from the parent data sets. In general below, the tests we 
perform are not to the training set but to independent sets of galaxies. 

In Appendix B we describe the form in which we release the star-formation histories and spectra asso- 
ciated with these templates. 

3.3. Explanatory power of templates 

These templates explain the photometric data rather well. For example, consider Figure 5, showing the 
color residuals of the observations with respect to the best fits, when fitting to galaxies with GALEX, SDSS, 
and 2MASS data. In this context we define the color residuals as (for example): 

A[U -g] = Kbs - 5obs] - ["model " 5model]- (12) 

In the code described in Section 7, we used the routine gst_k;correct to perform these fits. In Figure 5 the 
thick dashed lines show the estimated Icr uncertainties in the colors from the photometric catalogs. Relative 
to the uncertainties, there are no significant biases or redshift trends in these fits. Note that the galaxies 
used in this test are distinct from the training set on which we fit the five templates. 

The templates do well on higher redshift data, as well. For example, Figure 6 shows the color residuals 
for GOODS data. Here we find that there are somewhat larger residuals with respect to the uncertainties 
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(again shown as the thick dashed hnes). We can compare this result to that in Figure 7, which we get 
by fitting a new set of templates to only the GOODS data (this alternate set of five templates is also 
distributed in the manner that Appendix B describes). From Figure 7, we conclude that some of the errors 
are irreducible, and require either a larger set of templates for high redshift galaxies (a possibility we find 
a priori unlikely), or that there arc simply errors in the catalogs or the input filter curves. For example, in 
the z and V bands we find significant trends between redshifts z = 0.5 and 1.5 (echoed in Figure 6). More 
significantly, in the H band there is at least a 10% offset in the magnitudes across all redshifts, which is 
almost certainly a catalog or filter curve error. Other errors (such as a large scatter in the residuals in the 
B band and some redshift dependence in the J band) are clearly due to the fact that the templates are not 
primarily designed for GOODS data. All in all, we recommend using the special templates for /C-corrections 
within the GOODS data set. 

We can also test how well these templates recover actual spectra given only photometry. To do so, we 
take eight random SDSS spectra (chosen to span color space), project them onto the g, r and i bandpasses, 

and then fit the five templates to just these three fluxes. Figure 8 compares the reconstructed model spectra 
to the original spectrum. The two agree well in general, though some features (like the emission lines) are not 
reproduced well. Naturally, this success is mostly due to the quality of the original stellar spectra used in the 
population synthesis code of Bruzual & Chariot (2003). We are not suggesting that gri photometry contains 
as much information as the full spectra. However, these results encourage us that by using broad-band data 
we can infer JC-corrections of galaxies. 

3.4. Predictive power of templates 

Perhaps more interestingly, the templates do a good job of predicting missing data. That is, we can ask 
the question: if we use the templates to fit only to some bands but leave out others, how well do the best 
fits predict the bands left out? 

Consider Figures 9 and 10. The former shows the color residuals of fitting to both SDSS and 2MASS 
data for each galaxy. The latter shows the color residuals when we fit only to the SDSS data and do 
not include 2MASS in the fit at all. Without any input from 2MASS the templates do a very good job of 
predicting the 2MASS fluxes, with a scatter of 20-30% — not far from the uncertainties in the 2MASS fluxes 
themselves. Of course, it is not surprising that it is easy to predict the 2MASS fluxes — stellar spectra are 
very simple in the NIR, so having the ugriz-hand fluxes from SDSS yields much information regarding the 
redder bandpasses. 

However, we will note here that this result flies in the face of a persistent insistence that NIR observations 

are necessary to measure stellar masses of galaxies. If wc can predict the NIR observations themselves, 
clearly they cannot be adding much to our knowledge of the underlying stellar mass. This fact changes 
one's decisions about what data is best to use for calculating the stellar mass function. Basing it on 2MASS 
data only improves slightly the stellar mass estimate of each galaxy, while restricting (and thus biasing) the 
sample significantly at the lowest luminosities and surface brightnesses relative to the SDSS. Of course, if at 
high redshift galaxies have a very different locus of their SEDs, then perhaps this statement is less true for 
them. 

In addition, consider Figures 11 and 12. The former shows the color residuals of fitting to both SDSS and 
GALEX data for each galaxy. The latter shows the color residuals when we fit only to the SDSS data and do 
not include GALEX in the fit at all. Without any input from GALEX the templates do a significantly worse 
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job at predicting the GALEX fluxes. The scatter becomes about tr ~ 0.5 mag. As is common knowledge, 
it is more difficult to predict the UV fluxes, because dust and recent star-formation are so variable among 
galaxies. However, the median residuals based on are our templates still near zero. 



4. Determining ii'-corrections 



Given a model spectrum for the galaxy, the determination of the X-correction is straightforward. Here 
we give the relevant formulae for the ii'-corrcctions, leaving the derivation to Hogg et al. (2002). Then, we 
show the typical X-corrcctions for the data wc have fit to. 

The JC-correction between a bandpass R used to observe a galaxy at redshift z and the desired bandpass 
Q is defined by the equation (Oke & Sandage 1968; Hogg et al. 2002): 



rriR = Mq + DM(z) + KqrIz) - Slogio h 



(13) 



where DM{z) = 25 — 51og;^o(<^-£'/('''~^ Mpc)) is the bolometric distance modulus calculated from the lumi- 
nosity distance dt, and Mq is the absolute magnitude. The absolute magnitude is defined as the apparent 
magnitude an object would have if were observed 10 pc away, in bandpass Q, at rest. The traditional 
definition of the _R'-corrcction takes Q = R. However, wc note that in practice many surveys do perform 
iC-corrections from one observed bandpass R to another bandpass Q in the rest frame. This practice is par- 
ticularly common when dealing with high redshift observations. In addition to /C-corrections, this method 
also provides an interpretation of the data in terms of a physical model which describes the stellar mass and 
star-formation history of each galaxy. 



Equation (13) holds if the iV'-correction Kqji is 



Kqr = -2.5 logi 



dA„ Xn L\ 



1 + z 



R{Xo) / dAeAe5r(^e)0(Ae) 



+ J dXo Xo g^iXo) RiXo) J dAe Ae LxiXe) Q(Ae) 



(14) 



Here, R{X) and Q{X) represent the response of the instrument per unit photon entering the Earth's at- 
mosphere (or the telescope aperture for a space instrument), is the flux density per unit wavelength 
(e.g. ergs s~^ cm~^ A~^) for the standard source for band R, and similarly for band Q. For example, if 
the magnitudes are AB relative, then these represent the AB standard source, while if they are AB relative 
then they represent the spectrum of Vega. 



A particularly common special case is when R = Q: 

dXo Xo Lx 



Kr{z) = -2.5 log 



10 



1 



Xo 

1 + z 



R{Xo) 



[1- 



/ 



dAe XeLx{Xe)R{Xe) 



(15) 



For example, consider Figures 13, 15 and 14. For a randomly selected set of galaxies these figures show 
the X-corrections from the observed frame bandpasses of GALEX, SDSS, and 2MASS respectively, to the 
same bandpasses in the rest frame. 

Often, it makes sense to ii'-correct to a different set of band passes than the rest frame. For example, 
the SDSS Main sample is mostly around z = 0.1, so it does not always make sense to ii'-correct from the 
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observed frame r band to the rest frame r band. Doing so means applying large and uneertain corrections 
to all galaxies. If precision is not important, this uncertainty might be worth the simplification it brings. 
However, if one is (say) interested in the evolution of galaxies, the clustering of galaxies, or something else 
that requires precision, it makes more sense to avoid introducing unnecessary uncertainty. 

Our solution to this problem is to correct the magnitude to blue-shifted bandpasses that correspond to 
the observed bandpass at some intermediate redshift. We denote these bandpasses ^b, where the blue-shift 
in this case is by a factor 1 + z. For example, in the case of SDSS galaxies it makes sense to correct to the 
°-V bandpass — the r band blue-shifted to z = 0.1. This bandpass has the property that the if-correction 
is independent of the SED of the galaxy. If the magnitudes are AB relative, then the /C-correction to ^ for 
a galaxy at redshift z is simply — 2.51ogio(l + z). 

Figure 16 shows the iiT-corrections from the [ugri^;] bands to the '^■^ugriz] bands as a function of redshift 
for SDSS galaxies. Note that the i^-corrections converge at z = 0.1, so that around that redshift (where 
most galaxies are) the uncertainties in the ii'-corrections are minimized. Compare this simplicity to Figure 
15, where the ii'-corrections at redshift z = 0.1 in the g'-band range between 0. and 0.2 (and in the u-band 
from 0. and 0.5). Most SDSS galaxies arc near redshift z = 0.1, and obviously the ii'-corrections are not 
perfect — why introduce a large uncertainty for most galaxies when it can easily be avoided? 

5. Physical interpretation of the models 

In addition to /^'-corrections, these template fits also provide physical interpretations of the galaxies, 

since they correspond to actual stellar population synthesis models. Indeed, the code outputs three param- 
eters relevant to the star-formation history: the current mass of stars in the galaxy; the stellar mass-to-light 
ratio of the galaxy; and the fraction of the total amount of star-formation that has occurred recently. 

Kauffmann et al. (2003) have also calculated stellar masses for SDSS galaxies. Roughly, they have 
calculated the ^-band stellar mass-to-light ratios of models which best fit the absorption and D4000 
measurements of the spectra, and then applied those mass-to-light ratios to the observed 2;-band luminosities 
of the imaging. Figure 17 shows a galaxy-by-galaxy comparison of their stellar masses M,.., relative to our 
Ma,, as a function of and of intrinsic g — r color. The two sets of masses are very similar to each other, 
with a scatter of only 0.1 dex and trends of less than 0.2 dex with stellar mass. To be explicit, the masses 
in Figure 17 correspond to the current mass in stars remaining in the galaxy. This mass can differ from the 
total star-formation rate integrated over time, due to the fact that stars can die — and in practice for our 
fits the total integrated star-formation rate is usually about twice as high as the total current mass in stars. 

Similarly, Figure 18 shows the mass-to-light ratios of SDSS galaxies in the V band as a function of color 
in B — V. Here, we have calculated the B and V band fluxes based on the template flts to each galaxy, as 
explained in Section 6 below. We give the mass-to-light ratio in solar imits. The solid line is the relationship 
that Bell & de Jong (2001) give based on their fits to spiral galaxies. For blue galaxies {B — V < 0.8) this 
line is indeed a good fit to our results. For the typical red galaxy, the Bell & de Jong (2001) relation predicts 
a larger stellar mass than we do. 

Finally, we can also calculate measures of the recent star- formation rate from these fits. Since our fits are 
to broad-band photometry, and not based on emission line measurements, we cannot expect to have detailed 
information about the very recent star-formation rate. However, we can try to measure what fraction of the 
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total star- formation has occurred in, say, the past 1 Gyr: 

diSFR(t) ^^^^ 

where the definition here is meant to be similar to the birthrate parameter b in Kennicutt et al. (1994) and 
references therein. Note that here be is defined in terms of the total integrated star- formation history (unlike 
the stellar masses referred to above) as is standard for the b parameter. Figure 19 shows the relationship 
between the rest-frame u — r color for SDSS galaxies and bo- For galaxies with u — r < 2.5 (redder than 
which galaxies are mostly reddened, edge-on disks) these two parameters follow a simple relationship: 

logio6<, = -0.55(M-r). (17) 

Note, however, the large scatter (about 0.3 dex at la). 

We do not want to overstate the validity of these physical interpretations. They are sufficiently good 
physical interpretations to explain broad band data, but are not uniquely so. Certainly more detailed 
spectroscopic analysis can provide better constraints on star-formation histories and stellar masses. 

One particular weakness for high redshift data is that the templates always assume that there is 14 Gyrs 
of star-formation history. Thus, kcorrect may greatly overestimate the stellar masses of galaxies at 1 
(for example), where the actual star- formation history must last only 6 Gyrs. 



6. Linear relationships between common magnitude systems 

The software and templates that we present here are also useful for determining simple conversions 
between bandpasses performing other common tasks, such as calculating the absolute magnitude of the Sun 
in various band systems, and calculating the conversion between Vega and AB magnitudes. In this section, 
we present these tools. 

For example, Table 1 lists the effective wavelengths, conversion from Vega to AB magnitudes, and 
absolute magnitudes of the Sun in a number of filters: the GALEX filters, the Bessell filters, the SDSS 
filters, and the 2MASS filters. The listed numbers are the results of running the IDL functions k_lambda_ef f , 
k_vega2ab, and k_solar .magnitudes, respectively, so the reader can calculate the same thing easily on any 
given filter. The effective wavelengths listed for each filter use the definition 



Aeff = exp 



/ rf(lnA)i?(A) In A 
Jd(ln\)R{\) 



(18) 



following Fukugita et al. (1996) and Schneider et al. (1983). For the spectrum of Vega in the conversion 
of AB to Vega magnitudes, we use the Kurucz (1991) theoretical Vega spectrum (normalized at 5000 A 
to match the Hayes (1985) spectrophotometry of Vega). In k_vega2ab, the user has the option to use the 
spectrum of Hayes (1985) instead. We use the solar spectrum of Kurucz (1991) for the calculation of the 
solar absolute magnitude. 

For comparison among results from different surveys, one also wants to be able to convert the magnitudes 
in one set of bandpasses to magnitudes in another set of bandpasses. An example of doing so is given in our 
piece of code sdss2bessell, which takes SDSS input magnitudes and outputs absolute magnitudes in the 
Bessell U, B, V, R and I bandpasses. 
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However, often one wants to just make quick and dirty comparisons, without going through the trouble 
of running any software. For these purposes, we provide Table 2, which provides the linear relationships 
among the same bandpasses as those listed in Table 1. In fact, these relationships are usually good to 0.05 
mag or better. Note that Table 2 refers to AB magnitudes throughout; use Table 1 to get the relationships 
to Vega magnitudes. 

7. Summary 

We have here described a method for fitting templates to nearly-arbitrary sets of spectra and broad-band 
fluxes. The basic concept is that we can recover a small set of templates (based on models), nonnegative 
linear combinations of which can explain a much larger set of inhomogeneous observations. We have applied 
this method to SDSS, GALEX, 2MASS, DEEP2, and GOODS data to come up with a set of templates 
which are effective for describing all of these data sets. 

Nonnegative matrix factorization (NMF) is an effective way to reduce the dimensionality of any large 
data set, and has that in common with PCA. It is, in some sense, a "nonnegative PCA." However, at 
least in the current context the method has several advantages over PCA. First, it has a natural physical 
interpretation associated with that spectral subspace, which is the corresponding subspace of all possible 
star-formation histories. Second, it naturally handles data uncertainties and missing data, which allows it 
to ignore that variation which is due purely to statistical errors. Third, it handles the complications of 
observing galaxy spectra photometrically using broad-band filters. 

We note here that the general NMF method does not depend on using a model — our "model" could 
just have been a set of top hat functions or Gaussians on a wavelength grid. In some situations, such as 
datasets for which there are not well-developed theoretical models, this approach could be more appropriate. 

We have released our results in the form of a templates and a code base called kcorrect which fits those 
templates to many types of data (SDSS photometric and spectroscopic data, GALEX data, GOODS data, 
DEEP2 data, and 2MASS data). Furthermore, this code returns /T-corrections and a physical interpretation 
of the photometry. All of the plots in this paper were created using code in the repository. The code is 
available from a web page maintained by one of the authors'' . This web page is kept updated on improvements 
in the code and new developments. It consists of a C language library, stand-alone C programs, and IDL 
language wrappers around the C library. Thus, one can use the basic templates and fitting c;odc using 
only a machine with a C compiler. However, there is significant functionality which is programmed in the 
(unfortunately, proprietary) IDL language. These routines depend on the idlutils library^. 
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A. Nonnegative matrix factorization (NMF) 
A.l. Standard NMF 

The standard non-negative matrix factorization (NMF) problem, as originally posed by Lee & Seung 
(2000), is to approximate a data matrix X (of size Nk x Nn) as the outer product of two rank-A^^ non- 
negative matrices W and H. For example, in this paper. X represents the fluxes at Nn different wavelengths 
for different galaxies, H represents a set of Ni templates to build each galaxy from, and W represents, 
for each galaxy, the weights to give each template. This factorization is similar in spirit to singular value 
decomposition (SVD) and principal components analysis (PCA) but crucially, in NMF, X, W and H are all 
constrained to have non- negative entries. This changes the problem fundamentally, since no "cancellation" 
of positive and negative basis functions or coefficients is possible - all approximation interactions are strictly 
additive. The goal, as with SVD or PCA, is to minmize the squared approximation error (Frobenius norm): 

X^ = \\X-WH\\'' (Al) 

= Y.[^kn-Y.^kiHin\ (A2) 

kn \ i / 

In their seminal papers, Lee & Seung (2000) show that this approximation error in non-increasing under the 
following very simple multiplicative update rules: 

H^^ ^ Hi^ , t^^^]'" (A4) 

[M]ab denotes the (ab) clement of the matrix valued expression M. is the transpose of matrix H . 

These rules arc remarkable because, although they make finite (not infinitesimal) adjustments to the 
elements of the approximation matrices, they have no step-size parameters and are always guaranteed to 
reduce the error (or leave it invariant once they have converged), is in general a "non-convex" function 
of W and H, meaning we cannot guarantee there is only one local minimum. Therefore, the procedure does 
not necessarily find the global optimum. But in practice, even with random initialization, these rules seem 
to converge to good solutions for real data (at least data like ours). 



A. 2. NMF in the space of coefficients for a known basis 

The factorization problem we face in this paper is a slight variation on the basic NMF setup described 
above. We wish to approximate X by the product ABM, where M is a given (fixed) basis matrix of size 
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Nj X Nn and the optimization is over the matrices A (of size Nk x A/j) and B (of size Ni x AT,). All matrices 
X,A^BM have non-negative entries. This can be thought of as performing non-negative matrix factorization 
on the coefficients of an approximation, given a fixed (non-negative) basis, described by the columns of M. 
Once again, the objective we wish to minimize is the squared approximation error: 

(A5) 
(A6) 

This equation is almost that posed in Equation 11. In the case of this paper, this generalization allows us 
to express the templates in terms of star-formation histories, but to compare the predicted fluxes for the 
star- formation histories through the M matrix to the observed fluxes X. 

Of course, in the very special case where M is an invertible matrix, this problem can be transformed into 
the original NMF problem above by right multiplying the data matrix by M~^. However, in most situations, 
including ours, M has many fewer rows than columns, and as such is far from invertible. Fortunately, 
however, it is possible to derive multiplicative updates for this extended problem which minimze the error 
directly, even when M is not invertible. 

First, by thinking of the tensor product BM as a single non-negative matrix H, we can trivially derive 
a multiplicative update equation for the elements of A by using the W update provided above: 

A.. A. [^^"^"1^- (A7) 
^'"'^"'[ABMM-B-U ^ ^ 

By Lee and Seung's original proof, this update is guaranteed (for any non-negative matrices X,M and B) 
not to increase the approximation error. 

Our main algorithmic contribution is to derive a similar update equation for the elements of B: 

[A'^XM^]ij 

^ [A^ABMM^]ij ^^^^ 

Following the technique outlined in Lee & Seung (2000), it can still be shown that the error is nonincreasing 
under the application of this update. 

The proof involves the use of an inequality lemma for symmetric non-negative matrices: 

Lemma: For any symmetric matrix P having non-negative entries Pni > 0, any vector z having non- 
negative entries Zn > 0, and any vector y, 

li 

Zn 



For the standard NMF problem, we can prove that the approximation error (Al) is non- increasing under 
(A3) by using the above lemma to construct a function (j^iW, Z) which is an upper bound on the cost yi^iW) 
for any non-negative matrix Z: 

X^{W) = WkiWkiHi„Hi„ - 2 J2 WkiXknHi^ + ^ (AlO) 

knii kni kn 

cPiW, Z) = Y, Wli^Hir^Hi^ - 2 J2 WkiXuuHir, + Xkn (All) 

knii kni kn 

^{W,Z)>x^{W) yWki>Q, Zki>Q ( A12) 
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with equality being achieved when Z = W. Figure 20 represents this definition schematically, showing the 
true function and (j). The trick of the method is to define </> such that it is equal to at W, greater than 
everywhere else, and is easily minimizable. Then one can use the minimum of (j) as the update, and be 
guaranteed that one's updates do not increase x^. 

The function ^ can be analytically minimized with respect to its first argument: 

(f){W* , Z) < (P{W, Z) MW.Z (A13) 

If we set Z = W, this becomes exactly (A3) and now we can easily prove the validity of this update rule: 

X^{W) = <P{W, W) > cf>{W*,W) > x^{W*) (A15) 

where the first inequality comes from the fact that W* minimizes (/)(•, W) with respect to its first argument 
and the second comes from the fact that 4'{W* , •) is a bound on x^{W*). The proof of validity for the update 
(A4) is analogous by symmetry. 

To prove the validity of our update (A8) for our new problem, we proceed in a similar fashion, using 
the lemma twice to construct consecutive upper bounds on the cost: 

X^(B) = Yl AkiAuiBijBisMjnMsn (A16) 

kni^js 

— 2 ^ ^ XhnAkiBijAdjn + ^ ^ ^kn 



knij 



4>{B,Z,p)= AuiAueBl^^M^^M^r. (A17) 

kniijs ^■^ 

— 2 XkriAkiBijMjn + 

knij kn 

ct>{B, Z, p) > x'(B) V Bij > 0, Zkj > 0, pij > (A18) 
with equality being achieved when Z = AB and (5 = B. 

Once again, the bound (j) can be analytically minimized with respect to its first argument: 

, Z, /3) < <I>{B, Z,(3) yB,Z,(3 (A19) 



Ekn,s Ak^Au^M^nMsn 



If we set Z = AB and p = B, this becomes exactly (A8) and we can now prove the validity of this update 
rule: 

X^B) = <^(B, AB, B) > cp{B\AB, B) > x^{B*) (A21) 

where the first inequality comes from the fact that B* minimizes <p{-, AB, B) with respect to its first argument 
and the second comes from the fact that (j>{B*, •, •) is a bound on x^{B*). The only condition we require is 
that the elements of the matrix MM~^ are all non-negative. 



A. 3. Nonuniform uncertainties 



When different entries Xkn of the data matrix have different uncertainties, the natural objective function 
is the weighted approximation error (which also corresponds to the negative log likelihood under a Gaussian 



- 19 - 



noise assumption): 



kn 



This case can easily be handled since during the update for W the elements of H arc; fixed and vice 
versa and the updates are guaranteed not to increase the cost for any nonnegative matrices. In particular, 
when updating W, we can rewrite the cost function as 



X^ = E( — -E^^^— 1 (A23) 

^ ' CTftn ^ CTkn I 



2 



kn 

yielding updates of the form: 



and similarly, when updating H, we can rewrite the cost function as 

\ 2 



kn 

yielding updates of the form: 



^ ' CTfcn ^ akn / 



tiin ^Uin\2^ —2 / 12 1^26) 

\ k kn J \^f; kn ) 



This argument can be equally applied to our extended model, yielding the final update equations which 
are actually implemented in the kcorrect code: 



A,, ^ Afc, I ^ XknMjnB,, / E ^''"-^-"^y-"^"'-^^' ) (A27) 

jn k'"' I \m,ljn 



Bij ^ Bij 2^ -2 / ^2 1^28) 



kn y \m.lkn ^''^ 



B. Format for templates 

We have fit several different sets of templates that we release with the code, which we denote: 

1. default, the default set of five templates, 

2. Irgl, the single template fit to LRGs, and 

3. goods, the five template fit to just the GOODS data. 



- 20 - 



The information about the default set of templates is contained in the file data/templates/k_iimf .derived, default .fits 
in the kcorrect project. This file has 25 Header and Data Units (HDUs), each listed in Table 3 and de- 
scribed in more detail in the paragraphs below. There are similar files for the Irgl and goods template 
sets. 

As described in Section 2.2, there are A^basis = 485 basis spectra, consisting of 450 different instantaneous 
burst stellar populations (A^age = 25 ages, -/Vmcts = 6 mctallicitics, and A^^ust = 3 dust properties) plus 35 
different emission line models. The templates HDU has the coefficients of the five templates in this basis 
space. 

For each of the five templates, we have the template spectrum spec in units of ergs s~^ cm~^ per 
solar mass, as it would be observed at 10 pc distance. The wavelength grid is in the lambda HDU. As noted 
above, the models are smoothed at 300 km s~^ resolution; the rawspec HDUs have the original (unsmoothcd) 
models from Bruzual & Chariot (2003). In addition, we give versions without the emission lines (with the jal 
suffix) and without the dust extinction applied (with the jid suffix). The actual emission-line-only spectra 
for each template are in the Ispec HDU. Finally, the multiplicative dust extinction factor for each template 
spectrum is given in the extinction HDU. 

We include the star- formation rate as a function of time in the sf r HDU (the ago grid used is in the 
ages HDU). The time differential used to quantify this rate is in the dage HDU; to get the total number of 
stars formed at each age, multiply sf r by dage. The average metallicity of the stars formed as a function of 
time is in the metallicity HDU. 

The total mass formed in each template (in solar masses) is in the mass HDU — this is actually just 
unity for each template. The total surviving stellar mass for each template is in the mremain HDU. The 
metallicity in the surviving stars is given in the mets HDU. The total mass in stars formed in the last 300 
Myrs is in the mSOO HDU, and the total mass formed in the last 1 Gyr is in the mlOOO HDU. 

The properties of the 450 stellar population basis vectors are given in the four HDU: basis_ages, 
basisjnets, basis_dusts, and basisjnremain. Respectively, these give the ages, metallicities, dust prop- 
erties, and fraction of original stellar mass remaining for each instantaneous burst in the grid. The dust 
properties are given in terms of a structure with four elements which refer to the properties in the multiple 
scattering model of Witt & Gordon (2000): 

1. GEOMETRY: the geometry of the dust (e.g. "shell") 

2. DUST: the dust extinction curve (e.g. "MW" for Milky Way extinction and "SMC" for Small Magellanic 
Cloud type extinction) 

3. STRUCTURE: structure of the dust distribution ("h" for homogeneous, "c" for clumpy) 

4. TAUV: the amount of dust (the total optical depth in the V band) 
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Table 1. Properties of various filters 
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3546. 


0.91 


6.38 


5.47 


9 


4670. 


-0.08 


5.12 


5.20 


r 


6156. 


0.16 


4.64 


4.49 


i 


7472. 


0.37 


4.53 


4.16 


z 


8917. 


0.54 


4.51 


3.97 


J 


12355. 


0.91 


4.56 


3.65 


H 


16458. 


1.39 


4.71 


3.32 




21603. 


1.85 


5.14 


3.29 




3224. 


1.25 


6.78 


5.53 


O.lg 


4245. 


-0.01 


5.43 


5.44 




5597. 


0.04 


4.76 


4.71 




6792. 


0.27 


4.57 


4.30 


0.1^ 


8107. 


0.46 


4.52 


4.05 



Note. — Uses model solar spectrum and model Vega spec- 
trum of Kurucz (1991). Effective wavelength is defined in the 
text. The UBRVI filters are those of Bessell (1990). The 
ugriz filters are those determined by Mamoru Doi, Daniel 
Eisenstein, and James Gunn and available on the SDSS DR4 
web site.^ The JHKg filters are those from Cohen et al. 
(2003). 



- 22 - 





Table 2. Conversions among various 


filters 








EcjuMtiou 




dispell- 


sioii 


u = 


O-^M - 0.3310 - 0.3014 [(O iu -°-'^g) - 1.2839] 


afO-^-O-i^] 


— 


0.28 


9 = 


o-i^ - 0.3112 - 0.3530 - 0.7187] 




'•V] 




0.20 


r =' 


°-ir - 0.2026 - 0.3810 [(° V -^-^i) - 0.3530] 


a[0-V-0-ii] 




0.07 


i =°-H- 0.1072 - 0.4273 \{°-H -°-^z) - 0.1914] 


(7[°-^-°-i0] 


— 


0.13 


z = 


o-^i - 0.3161 - 1.2057 [(°-^ -°-'^z) - 0.1914] 






0.13 


u = 


C/ + 0.0682 + 0.0197 [{U - B) - 0.9602] 




-B] 


— 


0.20 


9 = 


B - 0.2354 - 0.3411 [{B -V)- 0.5870] 


a[B- 


-V] 


— 


0.18 


r = 


V - 0.2585 - 0.5003 [{V - R) - 0.3161] 


a [V - 


-R] 




0.07 


i = 


R - 0.2000 - 0.4248 [{R -I)- 0.2652] 


a[R 


-I\ 


— ■ 


0.12 


z = 


i? - 0.4088 - 1.2495 [{R -I)- 0.2652] 


a[R 


-I] 


— 


0.12 




= U + 0.3989 + 0.4135 [{U ~ B) - 0.9602] 


a[U - 


-B] 


— 


0.20 


O.lg 


= [/ - 0.8845 - 0.9508 [{U - B) - 0.9602] 


a[U- 


-B] 


— 


0.20 




= B + 0.0759 + 0.0620 [(B -V)- 0.5870] 


a[B- 


-V] 




0.18 


O.lj, 


= S - 0.6429 - 1.0845 [{B -V)- 0.5870] 


a[B- 


-V] 


— 


0.18 


0.1^ 


= y - 0.0558 - 0.1803 [{V - R) - 0.3161] 


a [V - 


-R\ 


— 


0.07 


0.1^ 


= R- 0.0927 + 0.0035 [{R -I)- 0.2652] 


a[R 


-I] 


— 


0.12 


0.1^ 


= i?, - 0.2841 - 1.0301 [{R -I)- 0.2652] 


a[R 


-I] 




0.12 


O.ly 


= u + 0.3310 + 0.3203 [(u - g) - 1.2638] 


a [u 


-9] 




0.26 


0.1^ 


= w - 0.9528 - 0.7572 [{u - g) - 1.2638] 


a [u 


-9] 


= 


0.26 


0.1^ 


= g + 0.3113 + 0.4620 [{g - r) - 0.6102] 




-r] 


— 


0.15 


0.1^ 


= 5 - 0.4075 - 0.8577 [{g - r) - 0.6102] 




-r] 




0.15 




= r - 0.1504 - 0.3654 [(r -i)- 0.2589] 


a [r 


-^] 




0.10 


0.1^ 


= i - 0.0836 - 0.7518 [{i - z) - 0.2083] 


a [i 


-A 




0.10 


u = 


= u - 0.0682 - 0.0140 [{u - g) - 1.2638] 


a [u 


-9\ 




0.26 


B = 


= u ~ 1.0286 - 0.7981 [{ti - g) - 1.2638] 


a \u 


-9\ 




0.26 


B = 


= g + 0.2354 + 0.3915 [{g - r) - 0.6102] 


^[9 


-A 




0.15 


V = 


= g - 0.3516 - 0.7585 [{g - r) - 0.6102] 




-A 




0.15 


R = 


= r - 0.0576 - 0.3718 [(r - i) - 0.2589] 


a [r 






0.10 


1 = 


z - 0.0647 - 0.7177 [{i - z) - 0.2083] 


a [i 


-A 




0.10 


U = 


=°-^u - 0.3992 - 0.3189 [(0 \t - 1.2839] 


(j[0-\.-0-i5] 




0.28 


B = 


=0-15 - 0.0759 - 0.0545 V - 0.7187] 


a[0-V-°-V] 




0.20 


V = 


=01 c, - 0.6628 - 0.9259 [(O ig -° ir) - 0.7187] 


a["-ig-0-ir] 




0.20 


R = 


=01 r - 0.2603 - 0.9162 [{°\ - 0.3530] 


(j[0-V-0-ii] 




0.07 


1 = 


- 0.1725 - 0.9718 [{^-H -^-^z) - 0.1914] 


a[0-^-0-iz] 




0.13 



Note. — Uses AB magnitudes throughout. 
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Table 3. Header and Data Units (HDUs) of k_miif .derived, default .fits 



Nuuilx^r 






Dim<'iisi( )iis 


Dcscriijtiou 





templates 




A^basis X Nt 


Coefficients in basis space for each template 


1 


spec 




Nspec X Nt 


Smoothed spectrum of each template 


2 


specjil 




A^spoc X Nt 


Smoothed spectrum of each template, lines removed 


3 


specjid 




Nspec X Nt 


Smoothed spectrum of each template, without dust 


4 


specjiljid 




-/Vspec X Nt 


Smoothed spectrum of each template, without lines or dust 


5 


rawspec 




Nspec X Nt 


Unsmoothcd spectrum of each template 


6 


rawspecjil 




Nspec X Nt 


Unsmoothed spectrum of each template, lines removed 


7 


rawspecjid 




N spec X N t 


Unsmoothed spectrum of each template, without dust 


8 


rawspecjiljid 




Ngpec X Nt 


Unsmoothed spectrum of each template, without lines or dust 


9 


Ispec 




iVspoc X Nt 


emission line spectrum 


10 


extinction 




Nspec X Nt 


extinction as a function of wavelength 


11 


lambda 




N 

-''spec 


Wavelength grid (A) 


12 


sfr 




iVage X Nt 


Star-formation rate (Mq per year) as a function of time 


13 


metallicity 




N^se X Nt 


Average metallicity as a function of time 


14 


ages 




N 

-''age 


Age grid (yrs) 


15 


dage 




N 

-''age 


Age bin size (yrs); sfr x dage = total star-formation in each bin 


16 


mass 




Nt 


total mass formed in each template 


17 


mremain 




Nt 


total current stellar mass in each template 


18 


mats 




Nt 


average metallicity of current stars in each template 


19 


m300 




Nt 


total stellar mass formed in last 300 Myrs in each template 


20 


mlOOO 




Nt 


total stellar mass formed in last 1 Gyr in each template 


21 


basis_ages 


N 


X A^'mets X TVdust 


ages of each stellar population basis vector 


22 


basisjnets 


N 


X iVmets X iVdust 


metallicity of each stellar population basis vector 


23 


basis_dusts 


-^age 


X -A^mets X A^dust 


dust properties of each stellar population basis vector 


24 


basisjnremain 


-^age 


X -A^mets X A^dust 


fraction of original stellar mass surviving for each basis vector 



Note. — All quantities are in floating point (four byte) precision, except dusts which is an array of structures, whose 
format is described in text. 
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1x10 



■11 



°^ 8x10"^^ 



Cv2 

I 



g 6x10 



■12 



4x10"^^ 
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2000 4000 6000 8000 10000 
X (Angstroms) 



Fig. 1. — Best fit LRG spectral template in the rest frame (normalization is for a 1 Mq galaxy located 10 
pc away, or equivalently a 10^^ Mq galaxy located 10 Mpc away). 
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10^ 10^ 10^ 10^ 10^ 

lookback time (yrs) 



Fig. 2. — Star- formation history corresponding to LRG spectral template of Figure 1. Top panel shows the 
number of stars formed per logarithmic time interval (t is expressed in years, curve is normalized for a 10^^ 
M© galaxy). Almost all of the stars are formed in the first couple of billion years — note that the recent 
"spike" is represents a tiny fraction (~ 10~^) of the total number of stars. Bottom panel shows the mean 
metallicity of the population as a function of time. Note that the details of these functions are rather poorly 
constrained. 
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Fig. 3. — SDSS colors of LRGs as a function of redshift. The grayscale is the conditional distribution of color 
within each redshift bin. The thin lines are the 10%, 25%, 50%, 75%, and 90% quantiles of the distribution. 
The thick line is the prediction of the model. The u band is not included in the fit, and the u magnitudes 
of most LRGs are poorly known. The other colors fit the models reasonably well. This model, remember, is 
given incredible freedom, meaning that the above agreement is the best one can do with the stellar population 
synthesis code of Bruzual & Chariot (2003). 
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Fig. 4. — Similar to Figure 2, but for the five global templates. Again, there are many degeneracies in 
the fit parameters (though there are fewer in the actual spectra), so these figures need to be interpreted 
appropriately. 
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Fig. 5. — Color residuals (defined explicitly in the text) of GALEX, SDSS, and 2MASS observations relative 
to our best fit 5-teniplate model. The greyscale is the conditional distribution of the color residual given 
the redshift. The thin hues are the 10%, 25%, 50%, 75%, and 90% quantiles of the distribution. The thick 
dashed lines show the estimated la uncertainties in the colors from the photometric catalogs. Relative to 
the uncertainties, there are no significant biases or redshift trends in these fits. 
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Fig. 6. — Color residuals in fit using the standard five templates to GOODS data, compared to the typical 
uncertainties (thick dashed lines). Note that the fits always do poorly on the H band, which we believe to 
be a catalog error. 
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Fig. 7. — Same as Figure 6, but fitting using five templates specially designed for GOODS. These templates 
have smaller residuals in many respects but still fail to fit the H band data. 
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Fig. 8. — Best fit model spectra based on the five template fit to g, r and i fluxes, compared to the original 
SDSS spectra from which we computed those fluxes. The models and the original spectra agree very well. 
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Fig. 9. — Similar to Figure 5 but for galaxies observed in both SDSS and 2MASS and only using SDSS and 
2MASS bands. The fits are to the SDSS and 2MASS data together. 
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Fig. 10. — Similar to Figure 9 but now the fits are only to the SDSS bands. The residuals in the 2MASS 
bands remain very small, indicating that the 2MASS measurements do not add a lot of information about 
these galaxies. 
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Fig. 11. — Similar to Figure 9 but for galaxies observed in both SDSS and GALEX and only using SDSS 
and GALEX bands. The fits are to the SDSS and GALEX data together. 
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Fig. 12. — Similar to Figure 11 but now the fits are only to the SDSS bands. 
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Fig. 13. — iC-corrections as a function of redshift in the GALEX near (N) and far (F) UV bands. 
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Fig. 15. — i4r-corrections as a function of redshift in the SDSS u, g, r, i, and z bands. 
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Fig. 16. — Same as Figure 15, but ii'-correcting to the ^'^g, ° V, '^'H, and °'^2; bands. 
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Fig. 17. — Our galaxy stellar mass estimates SmpSed to those of KaufFmann et al. (2003) Mg^*, as a 
function of stellar mass (top panel) and of color (bottom panel). The greyscale is the conditional distribution 
Mg,*/M» on each quantity . The lines are the 10%, 25%, 50%, 75% and 90% quantiles. 
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B-V 

Fig. 18. — Mass-to-light ratios of galaxies in the V band (in solar units) as a function of galaxy B — V color. 
The solid line satisfies the relationship logjQ(Af/iy) = l-40(g — r) — 0.73, given by Bell & de Jong (2001) 
for their sample of spiral galaxies. Their estimates and ours agree for B — < 0.8, where spiral galaxies 
dominate the galaxy population. 
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Fig. 19. — Fraction of the total star-formation that has occurred in the previous biUion years be, as a 
function of rcstframe u ~ r color, for SDSS galaxies. The greyscale is the conditional distribution of be on 
u ~ r. The lines are the 10%, 25%, 50%, 75% and 90% quantilcs. For u — r < 2.5, the median relationship 
follows the simple form listed in the figure. Galaxies with u — r > 2.5 are often highly reddened star-forming 
galaxies. 
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Fig. 20. — Example diagram of definition of (j) in the text, represents the actual while is designed 
such that it exceeds except at the single point W. By minimizing (j) we can find a point W* that we know 
to have an equal or better than that at W. 



